Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 5 de 5
Filtrar
Más filtros










Base de datos
Intervalo de año de publicación
1.
Front Artif Intell ; 5: 744755, 2022.
Artículo en Inglés | MEDLINE | ID: mdl-35757298

RESUMEN

The use of machine learning (ML) in life sciences has gained wide interest over the past years, as it speeds up the development of high performing models. Important modeling tools in biology have proven their worth for pathway design, such as mechanistic models and metabolic networks, as they allow better understanding of mechanisms involved in the functioning of organisms. However, little has been done on the use of ML to model metabolic pathways, and the degree of non-linearity associated with them is not clear. Here, we report the construction of different metabolic pathways with several linear and non-linear ML models. Different types of data are used; they lead to the prediction of important biological data, such as pathway flux and final product concentration. A comparison reveals that the data features impact model performance and highlight the effectiveness of non-linear models (e.g., QRF: RMSE = 0.021 nmol·min-1 and R2 = 1 vs. Bayesian GLM: RMSE = 1.379 nmol·min-1 R2 = 0.823). It turns out that the greater the degree of non-linearity of the pathway, the better suited a non-linear model will be. Therefore, a decision-making support for pathway modeling is established. These findings generally support the hypothesis that non-linear aspects predominate within the metabolic pathways. This must be taken into account when devising possible applications of these pathways for the identification of biomarkers of diseases (e.g., infections, cancer, neurodegenerative diseases) or the optimization of industrial production processes.

2.
Sci Rep ; 10(1): 13446, 2020 08 10.
Artículo en Inglés | MEDLINE | ID: mdl-32778715

RESUMEN

Metabolic pathway modeling plays an increasing role in drug design by allowing better understanding of the underlying regulation and controlling networks in the metabolism of living organisms. However, despite rapid progress in this area, pathway modeling can become a real nightmare for researchers, notably when few experimental data are available or when the pathway is highly complex. Here, three different approaches were developed to model the second part of glycolysis of E. histolytica as an application example, and have succeeded in predicting the final pathway flux: one including detailed kinetic information (white-box), another with an added adjustment term (grey-box) and the last one using an artificial neural network method (black-box). Afterwards, each model was used for metabolic control analysis and flux control coefficient determination. The first two enzymes of this pathway are identified as the key enzymes playing a role in flux control. This study revealed the significance of the three methods for building suitable models adjusted to the available data in the field of metabolic pathway modeling, and could be useful to biologists and modelers.


Asunto(s)
Glucólisis/fisiología , Redes y Vías Metabólicas/fisiología , Simulación por Computador , Entamoeba histolytica/metabolismo , Cinética , Modelos Biológicos , Modelos Teóricos , Fenómenos Físicos
3.
PLoS One ; 14(5): e0216178, 2019.
Artículo en Inglés | MEDLINE | ID: mdl-31067238

RESUMEN

The selection of optimal enzyme concentration in multienzyme cascade reactions for the highest product yield in practice is very expensive and time-consuming process. The modelling of biological pathways is a difficult process because of the complexity of the system. The mathematical modelling of the system using an analytical approach depends on the many parameters of enzymes which rely on tedious and expensive experiments. The artificial neural network (ANN) method has been successively applied in different fields of science to perform complex functions. In this study, ANN models were trained to predict the flux for the upper part of glycolysis as inferred by NADH consumption, using four enzyme concentrations i.e., phosphoglucoisomerase, phosphofructokinase, fructose-bisphosphate-aldolase, triose-phosphate-isomerase. Out of three ANN algorithms, the neuralnet package with two activation functions, "logistic" and "tanh" were implemented. The prediction of the flux was very efficient: RMSE and R2 were 0.847, 0.93 and 0.804, 0.94 respectively for logistic and tanh functions using a cross validation procedure. This study showed that a systemic approach such as ANN could be used for accurate prediction of the flux through the metabolic pathway. This could help to save a lot of time and costs, particularly from an industrial perspective. The R-code is available at: https://github.com/DSIMB/ANN-Glycolysis-Flux-Prediction.


Asunto(s)
Glucólisis , Análisis de Flujos Metabólicos , Redes Neurales de la Computación , Algoritmos , Análisis de Flujos Metabólicos/métodos , Redes y Vías Metabólicas , NAD/metabolismo
4.
BMC Bioinformatics ; 19(1): 382, 2018 Oct 16.
Artículo en Inglés | MEDLINE | ID: mdl-30326841

RESUMEN

BACKGROUND: Connecting the dots between the protein sequence and its function is of fundamental interest for protein engineers. In-silico methods are useful in this quest especially when structural information is not available. In this study we propose a mutant library screening tool called iSAR (innovative Sequence Activity Relationship) that relies on the physicochemical properties of the amino acids, digital signal processing and partial least squares regression to uncover these sequence-function correlations. RESULTS: We show that the digitalized representation of the protein sequence in the form of a Fourier spectrum can be used as an efficient descriptor to model the sequence-activity relationship of proteins. The iSAR methodology that we have developed identifies high fitness mutants from mutant libraries relying on physicochemical properties of the amino acids, digital signal processing and regression techniques. iSAR correlates variations caused by mutations in spectra with biological activity/fitness. It takes into account the impact of mutations on the whole spectrum and does not focus on local fitness alone. The utility of the method is illustrated on 4 datasets: cytochrome P450 for thermostability, TNF-alpha for binding affinity, GLP-2 for potency and enterotoxins for thermostability. The choice of the datasets has been made such as to illustrate the ability of the method to perform when limited training data is available and also when novel mutations appear in the test set, that have not been featured in the training set. CONCLUSION: The combination of Fast Fourier Transform and Partial Least Squares regression is efficient in capturing the effects of mutations on the function of the protein. iSAR is a fast algorithm which can be implemented with limited computational resources and can make effective predictions even if the training set is limited in size.


Asunto(s)
Análisis de Fourier , Ingeniería de Proteínas/métodos , Proteínas/química , Humanos
5.
Protein Eng Des Sel ; 27(10): 375-81, 2014 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-25169579

RESUMEN

In directed evolution experiments, it is at stake to have methods to screen efficiently the mutant libraries. We propose a web-based tool that implements an established in silico method for the rational screening of mutant libraries. The method, known as ProSAR, attempts to link sequence data to activity. The method uses statistical models trained on small experimental datasets provided by the user. These can integrate potential epistatic interactions between mutations and be used in many diverse biological contexts. It drastically improves the search for leading mutants. The tool is freely available to non-commercial users at http://bo-protscience.fr/prosar/.


Asunto(s)
Bases de Datos de Proteínas , Internet , Proteómica/métodos , Programas Informáticos , Algoritmos , Evolución Molecular Dirigida , Biblioteca de Genes , Mutación , Análisis de Regresión
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...